A Smith-Waterman Algorithm Accelerator Based on Residue Number System

نویسندگان

  • Kwame O. Boateng
  • Edward Y. Baagyere
چکیده

One of the biggest challenges confronting the bioinformatics community is fast and accurate sequence alignment. The Smith-Waterman algorithm (SWA) is one of the several algorithms used in addressing some of these challenges. Though very sensitive in doing sequence alignment, SWA is not used in real life applications due to the computational cost associated with the software implementation. Heuristics methods such as BLAST and FASTA are used, though they do not guarantee accurate sequence alignments. In this paper, we proposed a novel accelerator for addressing the challenge using Residue Number System (RNS). RNS is an integer system with properties that support parallel computation, carry-free addition, borrow-free subtraction, and singlestep multiplication (without partial product). Based on some of these properties, the design of a hardware accelerator for SWA is presented on the assumption that two long strings of DNA can be compared in a divide-andconquer manner. Simulation of a sample design indicates modest hardware consumption and much improved overall speed of the SWA. Introduction The origin of Residue Number System (RNS) can be traced to the puzzle given by Sun Tzu, a Chinese Mathematician and is illustrated as follows: How can we determine a number that has the remainders 2, 3, and 2 when divided by the numbers 7, 5, and 3, respectively? This puzzle, written in the form of a verse in the third century book, Suan -ching by the Chinese scholar Sun Tsu, is perhaps the first documented use of number representation using multiple residues. The answer to this puzzle, 23, is outlined in Sun Tzu’s historic work. The puzzle essentially asked us to 100 Kwame O. Boateng and Edward Y. Baagyere convert the residues ( ) 2 3 2 RNS(7|5|3) into its decimal equivalent. Sun Tsu formulated a method for manipulating these remainders (also known as residues), into integers. This method is regarded today as the Chinese Remainder Theorem (CRT). The CRT, as well as the theory of residue numbers, was set forth in the 19 century by Carl Friedrich Gauss in his celebrated Disquisitiones Arithmetical [1]. This over 1700 – year old number system is making waves in computing recently. Digital systems implemented on residue arithmetic units may play an important role in ultra – speed, dedicated, real – time systems that support pure parallel processing of integer – value data due to its inherent features such as carry free addition, borrow free subtraction, single step multiplication without partial product, parallelisms, and fault tolerant. These interesting properties of RNS have lead to its widespread usage in Digital Signal Processing (DSP) applications such as digital filtering [2], [3], [12], convolution, correlation, Fast Fourier Transform (FFT)[2], [18],[19]. Discrete Cosine Transform (DCT)[4], [5], image processing, cryptography, digital communications[16], [17] and other highly intensive arithmetic applications[8], [18]. However, RNS has not found wide spread usage in general purpose processors due to difficulties associated with magnitude comparison, sign representation, overflow detection, data conversion, moduli selection, division, and other complex arithmetic operations[6], [10], [11], [13]. RNS is defined in terms of a relatively – prime moduli set { }, ..., 3 , 2 , 1 n m m m m that is GCD ( ) 1 , = j m i m for j i≠ , where GCD means greatest common divisor. A binary number X can be represented by the residues ( ) n x x x x ,..., 3 , 2 , 1 , where i x i m X mod = , i m i x < ≤ 0 . Such a representation is unique for any integer [ ] 1 , 0 − ∈ M X , where

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Improved RNS Reverse Converter in Three-Moduli Set

Residue Number System (RNS) is a carry-free and non-weighed integer system. In this paper an improved three-moduli set  in reverse converter based on CRT algorithm is proposed. CRT algorithm can perform a better delay and hardware implementation in modules via other algorithms. This moduli is based on p that covers a wide range on modules and supports the whole range of its modules in dynamic r...

متن کامل

Striped Smith-Waterman speeds database searches six times over other SIMD implementations

MOTIVATION The only algorithm guaranteed to find the optimal local alignment is the Smith-Waterman. It is also one of the slowest due to the number of computations required for the search. To speed up the algorithm, Single-Instruction Multiple-Data (SIMD) instructions have been used to parallelize the algorithm at the instruction level. RESULTS A faster implementation of the Smith-Waterman al...

متن کامل

Hardware accelerator architecture for simultaneous short-read DNA sequences alignment with enhanced traceback phase

Dynamic programming algorithms are widely used to find the optimal sequence alignment between any two DNA sequences. This manuscript presents a new, flexible and scalable hardware accelerator architecture to speedup the implementation of the frequently used Smith–Waterman algorithm. When integrated with a general purpose processor, the developed accelerator significantly reduces the computation...

متن کامل

Comparison of methods for searching protein sequence databases.

We have compared commonly used sequence comparison algorithms, scoring matrices, and gap penalties using a method that identifies statistically significant differences in performance. Search sensitivity with either the Smith-Waterman algorithm or FASTA is significantly improved by using modern scoring matrices, such as BLOSUM45-55, and optimized gap penalties instead of the conventional PAM250 ...

متن کامل

In-socket Fpga Implementation of Bioinformatic Algorithms Using the Intel Aal

We demonstrate the implementation of two important bioinformatic algorithms in the XtremeData XD2000i FSB module using the Intel Accelerator Abstraction Layer (AAL). We have modified SSEARCH35 and NCBI BLAST, industry standard open-source implementations of the Smith-Waterman and BLAST algorithms respectively, to transparently introduce a hardware accelerated option to users.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011